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Abstract. This article is the second in a series of three papers, whose scope is to give new proofs 
to the well known theorems of Calderon, Coifman, Mcintosh and Meyer [1], [7], [8]. Here we 
treat the case of the Cauchy integral on Lipschitz curves and some of its generalizations. 



1. Introduction 

This paper is a continuation of [18] and it is the second of a three papers sequel. Let A be a 
Lipschitz function on the real line R. It defines a Lipschitz curve T in the complex plane, by the 
parametrization x — * x + iA(x). The Cauchy integral associated with this curve is the singular 
integral operator Cr given by 

C T f(x):=p.v. f —IQL-——dy. (1) 

JR (x-y) + i(A(x) - A(y)) 

The goal of this article is to give a new proof to the well known theorem of Coifman, Mcin- 
tosh and Meyer [8], which says that Cr extends naturally as a linear bounded operator from LP 
into LP for any 1 < p < oo. Moreover, the method of proof will also allow us to obtain several 
new generalizations of this important theorem, which will be described in the last section of the 
paper. 

As it is known, standard arguments reduce this problem to the problem of proving polynomial 
bounds for the associated Calderon commutators, defined by 

c d f(x) := p.v. J r (x _ y)d+l f(y)dy. (2) 

More precisely, it is enough to prove that 

\\C d f\\ P < C(d) ■ C{p) ■ \\f\\ p ■ \\A'\\t (3) 

for any f e L p , where C(d) grows at most polynomially in d. 

Simple and standard calculations, similar to the ones in [18], show that for a := A' and / 
Schwartz functions, (2) exists and can be rewritten as 



Jr' 1 X 



sgn(£ + a it i + ... + a<£ d )da\ ••■da a 



(4) 



As a consequence, C d can be seen as a ( d + 1)- linear operator. However, since the symbol 

l 
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sgn(£ + aii;\ + ... + a d % d )da y ...da d 



(5) 



[0,l] rf 



is not a classical Marcinkiewicz-Mihlin-Hormander symbol, there are no estimates for C d that 
can be easily passed to the multilinear theorem of Coifman and Meyer [7] and this is why 
proving (3) even without polynomial bounds, is a more delicate problem than an estimate on 
paraproducts. In [18], we gave a new proof of (3) in the particular case of the first Calderon 
commutator C\. The proof was based on the observation that even though is not 

a classical symbol, when smoothly restricted to Whitney squares (with respect to the origin) 
the Fourier coefficients of the corresponding functions decay at least quadratically. This fact, 
together with the logarithmical bounds for the shifted Hardy-Littlewood maximal functions and 
Littlewood-Paley square functions (also proved in [18]), were enough to reduce the problem to 
a setting where the method developed in [20] and [21] could be applied. It is on the other hand 
not difficult to realize, that even if one would assume that such quadratic estimates hold true 
in the general case of m d (^,^ u ...,£*), these observations alone would not be enough to obtain 
(3), since then one would end up summing 0(d) power series, which would finally generate 
an exponential upper bound of type C d . The main new idea to obtain the desired polynomial 
bounds in (3), is to realize that instead of treating m d as a whole multiplier of d+ 1 variables, one 
can see it as being a multiple average of various m\ type multipliers. In other words, throughout 
this paper, we will never need to go beyond the understanding of the symbol of the first Calderon 
commutator, to be able to obtain polynomial bounds for all the other commutators. This may 
seem surprising at the first glance, but could also be seen as an explanation of a somewhat 
similar observation of Verdera [24] who showed that in a certain sense, the Cauchy integral is 
dominated by the first Calderon commutator. 

Now, coming back to (3), we will prove the following 

Theorem 1.1. Let 1 < p\,...,p d +\ < oo and 1 < p < oo be so that \/p\ + ... + l/p d+ i = 1 1 p. 
Denote by I the number of indices ifor which p t ^ oo. Then, C d extends naturally as a (d + 1)- 
linear operator bounded from L P[ x ... x L Pd+l — > U with an operatorial bound of type 



where C(d) grows at most polynomially in d and C(pi) = 1 as long as pt = oofar 1 < i < d + 1. 

Assuming for a moment Theorem 1.1, we see immediately that our desired estimate (3) 
follows from it by taking p x = p and p 2 = ... = p d+ \ = oo. 

Let us remember now that since C d is a (d + 1)- linear operator, it has (d + 1) natural adjoints. 
To define them, recall the definition of the associated (d + 2)- linear form A d given by 



C(d)-C(i)-C( Pl )-...-C( Pd+l ) 



(6) 



JR 

Then, for every 1 < i < d + 1 one defines C*J by 




(7) 




(8) 



For symmetry, we also use the notation C d = C*J 
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To prove Theorem 1.1, we will show that for every 1 < i < d + 2 and for every (pi,..., 4>d+i 
Schwartz functions, one has 

IIC*''^!, ...,fa + i)\\ p < C(d) ■ C(l) ■ C(pi) ■ ... • C(p d+i ) ■ \\<f>i\\ pi • ... • \\<p Pd J\ d+ i, (9) 

where pj for 1 < j < d + 1 and p are as before. From this one can immediately extend C*J by 

density on an arbitrary product of LP> spaces, as long as 1 < p } < oo and on S°° spaces (the 
closure of the family of Schwartz functions in L°°) in the case when pj = oo. 

In the next section we will explain how one can then use duality arguments, to define C*J even 
further, to generic products of LP> and L°° spaces. These duality arguments will also clarify the 
necessity of proving the wider range of estimates which appear in Theorem 1.1 and (9) for C d 
and its adjoints. 

Acknowledgements: The present work has been partially supported by the NSF. 

2. Duality and the extension from S°° to L°° 

For any # = 0, 1, d denote by the statement that the inequalities (9) for C d and all its 
adjoints, extend naturally to the situation when at most # of the LP' spaces are equal to L°° and 
the rest are either S°° or correspond to an index j for which 1 < pj < oo. The goal would be 
to prove that S(d) holds true, assuming 5(0) (which we promised to prove later on). To show 
that 5(#) implies 5(# + 1), let us fix some indices 1 < pi, p d+l < oo as in the hypothesis of 
Theorem 1.1. Since the argument is completely symmetric (in particular all the adjoints can be 
treated similarly) we can assume without loss of generality that we want to extend (9) for C d , 
when the first # + 1 functions f\, f#+\ belong to L°° while the other 0# +2 , (pd+i are Schwartz 
functions. 

Case 1: p > 1. Since in this case (L p )* = LP' for l/p + 1/p' = 1, one can simply use duality 
and define C d (fi, 0#+2, 4>d+\) to be the unique LP function with the property that 

I C d (f l ,...,f# +l ,4># +2 ,...,4>d+i)(x)4>d+2(x)dx= I Cfifi, (f>#+2, ...,(f>d+i,4>d+2)(x)fi(x)dx 

for any Schwartz function 4>d+i with \\4>d+2 1 \ P ' = 1- This is clearly well defined as a consequence 

'd 



of5(#)forC* 1 



Case 2: p = 1. In this case one has to be a bit careful since this time the dual of L 1 is L°° 
and the Schwartz functions are no longer dense in it. However, we first observe that since 
p = 1, there must be at least two indices j\ and j 2 for which 1 < Pj^p^ < oo. Again, by the 
symmetry of the argument, assume that these indices are precisely # + 2 and # + 3. To define 
properly C d (fi, <p#+2, —^d+i) as an element of L 1 , we first observe that one can do this 

as an element of (say) L 2 , by taking advantage of the fact that all the functions (pj are Schwartz 
and therefore belong to all the L q spaces simultaneously, for 1 < q < oo. Indeed, one can for 
instance think of (p#+2,(p#+3 as being in L 4 while the rest of 7 all lie in S°° and then define 
Cdifu f#+\,<P#+2, <Pd+\) as being the unique function in L 2 with the property that 

I C d {f\,...,f #+ i,(p #+ 2,-,<Pd+\){x)(pd+2{x)dx= I C* d \f 2 ,...,f# +l ,(p# + 2,-,<Pd+i,<Pd+2)(x)Mx)dx 
«JR «JR 
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exactly as before, for any (p d+2 Schwartz function with H^^lb = 1> since one can rely again on 
S^forC* 1 . 

Now that we know that ...,f#+i, 4>#+2, —, <pd+i) is a well defined L 2 function, we would 

like to prove that it is in fact in L 1 , as desired. One can write, for any big M > 



\Cd(fi,-,f#+u<t>#+2, -,<Pd+i)(x)\ dx = I Cd(fi,...,f# + i,<p#+2, ...,<pd+i)(x)X[-M,M]( x )dx 
m JR (10) 

where \x { -m,m]( x )\ = X[-m,m]( x ) almost everywhere. 

Pick now a smooth and compactly supported sequence (f d+2 )n so that f% +2 — > X[-m,m]( x ) 
weekly and so that ||/? +2 ||oo < 1 (one can simply convolve X[-m,m] with a smooth approximation 
of identity, to obtain such a sequence). In particular, one can then majorize (10) by 

lim I Cd(Ju...,f# + i,<p#+2,-,<Pd+i)(x)fa +2 (x)dx < 
n IJR 

SU P I C d 1 (f 2 ,...,f# + i,(p# + 2,---,(pd + ufS + 2)( x )Mx)dx 

n UR 

and since now f' d \ 2 e S°° and ||/J +2 lloo < 1, one can again use the induction hypothesis to 
complete the argument. 

Also, a careful look at the whole duality procedure, shows that if we assume (9) with C(d) 
growing polynomially, then this will be preserved after replacing all the S°° by the correspond- 
ing L°°. 

We are thus left with proving (9) for Q and its adjoints. The advantage of it is that when 
applied to Schwartz functions, all the operators C*J for 1 < i < d + 2, are given by well defined 
expressions similar to (4). Later on, they will be furher decomposed and discretized carefully, 
and this will allow us to reduce (9) even more, to some similar estimates, but for finite and well 
localized model operators. 

3. Logarithmic estimates and discrete models 

In this section the goal is to describe some logarithmic estimates for certain very concrete 
discrete model operators, which will play an important role in proving the desired (9). 

In order to motivate them, and also to get a general idea of the strategy of the proof, let us 
assume for simplicity that instead of (9), one would like to prove LP x L°° x ... x L°° — > LP 
estimates (say) for a generic paraproduct n rf+ i(/i, ...,fd+\) whose (d + 2)- linear form is given 
by 

f * °iX*)-tfc + 2 * Vt +2 )(x)dx (1 1) 

JR k 

where f x e LP, fj e L°° for 2 < j < d + 1, while f d+2 e U' with l/p + l/p' = 1. 

As usual, all the functions {$> J k )k are smooth L 1 normalized bumps, adapted to intervals of 
the form [-2~ k ,2~ k ] for k e Z, and for at least two indices ji,j 2 one has Jr^ 1 (x)dx = 

(£>f(x)dx = (in which case we say that their corresponding families are of ¥ type, while 
the others are of O type). 
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We witness several situations. 

Case A: j\ = 1 and j 2 = d + 2. Let us also assume that the L 1 norms of the functions in the 
O families are not only uniformly bounded, but they are bounded by 1 . In particular, for any 
2 < j < d + 1 (in this case) one has 



|/y*OiU)|<||Oilli-||/y||co<||/j| 



One can then majorize (1 1) by 



(12) 



d+l p 

[]\\fj\\oo- J]\fl*®l(x)\\f d+2 *® d k +2 (x)\dx< 
y=2 JR k 

fj • Jr (z 1/1 * °^ w|2 ) • (z ^ * °" +2wi 



nwi--j R 



d+l d+l 

H ll/ylU • 115 (/OUp • \\S(f d+2 )\\ p , < \\ 11/,-IU • H/illp • H/^Hp, 

7=2 7=2 

as desired, by using the fact that the Littlewood-Paley square function S is a bounded operator 
on any L q space, for 1 < q < oo. 

Case B: 71 = 1 and j 2 = 2. This case will not be that simple. This time, one can only majorize 
(11) by ' 

d+l p 

n^ii-- z 1/1 * °^ ),|/2 * * orwi^ < 



7=3 



d+l ~ ( 

\\wnu- L Z^*°- 

j=3 JK \ k 



.1/2 



1/2 



• \J]\f2*^l(x)\ 

Y\\\fj\\oo- f S(f 1 )(x)S(f 2 )(x)M(f d+2 )(x)dx< 

7=3 JR 

d+l 

n • n5(/i)iu • iis(/ 2 )iu • iiw^2)n 4 < 



mp\f d+2 *® a k +z (x)\\dx 

k 



d+l 



7=3 



d+l 



nil/ylloo-ll/ilU-II^IU-ll/^ll,^ 



7=3 



for any 1 < s\ , s 2 ,s 3 < 00 so that 1 /si + 1 /s 2 = 1 /s 3 , by using the fact that besides S , the Hardy - 
Littlewood maximal function M is also bounded on any L q space, for 1 < q < 00. Clearly, the 
estimate we are looking for corresponds to s\ = S3 = p and s 2 = 00 and it cannot be obtained in 
this way, since S is unbounded on L°°. 
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If on the other hand one freezes the functions f 3 , f d+l , expression (11) becomes a 3- linear 
form and the above estimates show that its associated bi-linear operator H 2 (fi,f 2 ) is bounded 
from U l x U 2 into U 3 . By symmetry, the same is true for both n* 1 and n* 2 . The estimate we 
are interested in, can then be rephrased as 



n 2 :L p xL°° ^U. (13) 

To get it, one needs besides the previous Banach estimates to prove quasi-Banach estimates as 
well, of the form n* 2 : L n x L n — > U\ for any 1 < r\, r 2 < oo, < r 3 < oo with l/>i + l/r 2 = 
1 /r 3 . In the case of paraproducts, there are several ways to achieve that, see for instance [7]. In 
the end, one can use multi-linear interpolation between the Banach and quasi-Banach estimates 
as in [22], to obtain the intermediate (13). Even more precisely, the convexity argument in [22] 
shows that there exist two Banach and one quasi-Banach estimates (with implicit boundedness 
constants C B , C\, C q - B respectively) so that if one denotes by C B '■= maxjC^, C\, C q ^ B }, one has 
that this constant represents an upper bound for the boundedness constant of (13). 

Case C: j\ = 2 and j 2 = 3. Finally, we are left with this situation which can be treated similarly 
to the previous one. More precisely, one can majorize (11) this time by 



d+l r 
j=4 ^ R k 



d+i ^ r \ 



1/2 



|2 



1/2 



sup |/i * ® l k (x)\ 1 1 sup \f d+2 * Q>l +Z (x)\ I dx 



j=4 JR V k ) V k 

d+l „ 

\\\\fj\U- S(f 2 )(x)S(Mx)M(f 1 )(x)M(f d+2 )(x)dx< 

d+l 

["[ ll/;IU • IIWOIU • \\S(f 2 )\\ S2 ■ \\S(f3)\\ S3 ■ \\M(f d+2 )\\ s , < 

M 

d+l 

l^\\fj\L-\\Msr\\f2\\s 2 -\\f3\\ S3 ¥d+2h, 

for any 1 < s\, s 2 , 53, 5 4 < 00 so that 1/ s\ + l/s 2 + I/53 = I/J4. 

The estimate we are looking for corresponds to s\ = s 4 = p and s 2 = S3 = 00 and as before, 
it cannot be obtained in this way. 

This time one freezes the functions f 4 , ...,f d +\ and then expression (11) becomes a 4- linear 
form and the estimates above show that its associated 3- linear operator n 3 (/i,/ 2 , fa) is bounded 
from U l x U 1 x U 3 into U\ By symmetry, the same is true for its adjoints n* 1 , n* 2 and II* 3 . 
The estimate we are interested in, becomes 



n 3 : IT x L°° x L°° -> If. (14) 

As in the previous case, to get it, one needs besides the previous Banach estimates to prove 
quasi-Banach estimates as well, of the form II3 2 , IT3 3 : L n x L n x U 3 — > U\ for any 1 < 
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r u r 2, r 3 < oo, < r 4 < oo with l/r 1 + l/r 2 +l/r 3 = l/r 4 . And in the end, one can again use multi- 
linear interpolation between the Banach and quasi-Banach estimates, to obtain the intermediate 
(14). In particular, the same convexity argument shows that there exist two Banach and two 
quasi-Banach estimates (with implicit boundedness constants C B , C\, C l q _ B , C 2 q _ B respectively) 
so that if one denotes by C B '■= max{C^, C|, C q _ B , C 2 q _ B }, one has that this constant is an upper 
bound for the boundedness constant of (14). And this ends the discussion on the boundedness 
of from LP x L°° x ... x L°° into LP since by symmetry, it is easy to realize that any other 
possibility can be reduced to one of these cases 

There are a couple of important facts that one learns from the previous argument. First, the 
bounds are independent of d. Responsible for this is the assumption that the L 1 norms of the <D 
families are all at most 1, which implied the crucial (12). Then, there is the fact that after using 
(12) several times, we reduced our analysis to the study of several (Banach or quasi-Banach) 
corresponding estimates, for some minimal bi-linear or tri-linear operators. 

We claim now that in spite of the fact that Q is not a Coifman-Meyer operator, it can be 
studied in an analogous manner. More precisely, one can decompose it first into polynomially 
(in d) many paraproduct like pieces, and then estimate each piece independently of d. And 
also as before (since the Banach estimates are easy) we will reduce the main inequality (via 
interpolation), to similar quasi-Banach estimates for minimal I- linear operators, for some 1 < 
I < d+l. The proof of the precise quasi-Banach estimates is in general a delicate issue, but it 
has already been discussed in detail in [18]. 

The necessity to discretize the minimal (I + 1)- linear forms justifies the introduction of the 
following model operators. 

Fix then / a positive integer, n[, ...^arbitrary integers and consider families (Oj_)/, (OjL)/, 

••• X&i-)i of L 2 normalized bumps adapted to dyadic intervals % (as in [18], given /, denote 
by Ijp the interval of the same length as /, but sitting rij units of length |/| away from I) so that 
at least two of them are of *F type (i.e. their integrals are zero) 2 . Define the / linear discrete 
operator Tj for PJ a finite family of dyadic intervals, by 

Tj(fu..,fd = ^^ 2 {fu^)...{ftM^r- d5) 

One has 

Theorem 3.1. For any such a family PJ, the l-linear operator Tj maps LP X x ... x LP' — > LP for 
any 1 < p\, ...,pi < oo with \/p\ + ... + l/pi = l/p, < p < oo, with a bound of type 

0(\og <~n[> • log < ni >). 

Here, as in [18], <n]> simply denotes 2 + \n]\. Also, the implicit constants above are allowed 
to depend on I. 

ht should also be clear that a similar argument works in the general IL/+1 : LP 1 x ... x L Pd+ ' -» LP case. Instead 
of the minimal bi-linear or tri-linear operators which appeared before, one would have to deal with /- linear ones 
for some 1 < / < d + 1, but the interpolation between the natural corresponding Banach & quasi-Banach estimates, 
works precisely in the same way. 

9 1 1 

A smooth function is said to be adapted to an interval /, if one has |5 ff <I>(x)| < — d / (jj M , for any derivative 

a so that \a\ < 5 and any large M > 0, with the implicit constants depending on it. Then, if 1 < p < oo, we also 
say that \I\~ llp $> is LP normalized. 
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This theorem is the I- linear generalization of the bi-linear Theorem 2 in [18] and since its 
proof is identical to the proof of that theorem, we are leaving it to the reader. 

More precisely (as in [18]), Theorem 3.1 follows (by scale invariance and interpolation) from 
the more precise fact that for every fj e U> with \\fj\\ Pj = 1, 1 < j < I and measurable set E c R 
of measure 1, there exists a subset E' c E of comparable measure so that 

Z ITral^ 1 ' ^ 1 -" 1 ^' ^!!^!' /" 1 )! * lo § < "T > •- • lo § < ^ >. (16) 
where =^ £ >. 

As in [18], the fact that one looses only logarithmic bounds in the estimates above, will be 
important later on. 

In the rest of the paper we will describe the calculations that are necessary to show how the 
desired (9) can be indeed reduced to (16). 

4. Reduction to the discrete model 

We treat the case of Cd only, since by the symmetry of the argument, all its adjoints can be 
understood in a similar way. Fix then indices pj for 1 < j < d + 1 and p as in the hypothesis 
of Theorem 1.1. As suggested before, the first step is to decompose Q into polynomially (in d) 
many paraproduct like pieces which will be analized afterwords. 

Here, the clasical Littlewood-Paley decompositions will be of great help. However, since we 
want to have the perfect inequalities (12) available, we need to work most of the time with non- 
compact (in frequency) approximations of identity, which will cause several technical dificulties 
later on. We define them in detail in the next subsection. 

Non-compact Littlewood-Paley L 1 normalized projections. Start with a Schwartz function 
O(jc) which is even, positive and so that <$>(x)dx = 1 . Define also ^(x) by 

x P(x) = 0(x)-l/20(x/2) 

and observe that x V{x)dx = 0. Then, as usual, ^(x) and <S> k (x) denote 2 kx ¥(2 k x) and 2*0(2 fe x) 
respectively. Notice that all the L 1 norms of are equal to 1. Observe also that one has 

Y t (*) = - <&*_!(*), 

and then it is easy to see that 

z>=% 

k<k 

and also that 



or equivalently 



Z^ = ^o (17) 
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= 1 (18) 

keZ 

for almost every £ e R. On the other hand, 

?(0) = | =1-1=0. 
JR 

Moreover, since one has that 

?'(£) = -2™ f ^(x)e" 2mj:f ^ 
JR 

and in particular 

?'(0) = -2m" f x x ¥(x)dx = 
JR 

by using the fact that O is an even function. As a consequence, one can write ¥(£) as 

?(£) = ^) (19) 

for another smooth and rapidly decaying function (p. 

These are the non-compact L 1 normalized Littlewod-Paley decompositions. The compact 
ones are obtained in a similar manner, but one starts instead with a Schwartz function O having 
the property that supp <J> c [-1, 1] and O(0) = 1. 

Some remarks on the symbols of Cj for d > 2. Before going any further, it is worthwhile 
to have a look at the symbol of the second commutator C 2 . It is very natural to try to see if 
its Fourier coefficients satisfy the same quadratic estimates (proved in [18]) as the symbol of 
C\. Consider for instance three Schwartz functions <p(£), 0(£i) and 0(^2 ) supported inside the 
intervals [-2, -1], [1,2] and [-1/2, 1 /2] respectively. Clearly, the function 

is supported inside a Whitney cube (with respect to the origin) in R 3 and the goal is to under- 
stand the expression 



f 3 f l^ + a ^+^2)dadj3 
JR LJ[o,i]2 + 

-2nint; -Initial -2nin 2 & 



vitMtom) ■ (20) 



when n,ni,n 2 are arbitrary integers. Since %\ can never be zero, the symbol in (20) can be 



rewritten as 



f T C lR + (Z + a+ ff2)dad{3. (21) 
Jo si Jo 



When one differentiates (21) with respect to %\ the inner term becomes 
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l R+ (£ + £+/^ 2 )46 

which coincides to + The important difference now is that since £ + £1 lies inside 

the interval [—1,1] and £ 2 inside [-1/2, 1/2] and they both contain the origin, one can no longer 
continue to apply the argument in [18]. 

As a consequence of this particular example, the Fourier coefficients in (20) seem to decay 
only linearly, which is clearly not enough. In any event, these comments show that passing 
from the analysis of the first commutator to the analysis of the second one and all the rest, is not 
at all a routine task. One should recall that there are ten years difference between the results of 
Calderon [1] and the ones of Coifman and Meyer [6]. 

On the other hand, this also shows that from this point of view at least, the symbol of C 2 looks 
similar to the symbol of the bilinear Hilbert transform (given by sgn(£i - £2)) whose Fourier 
coefficients decay also only linearly as one can easily check. This fact might also be considered 
as another possible explanation of why Calderon suggested the study of the bilinear Hilbert 
transform as a step towards understanding all his commutators, besides the one recalled already 
in [18]. 

The generic decomposition of C d . Coming back to our goal, notice first that because of (4) if 
/, f\, f d+ \ are all Schwartz functions, one can write the (d + 2)- linear form A d (f, f u f d+i ) 
associated with C d as 

(22) 

By combining several Littlewood-Paley decompositions as in (18), one can write 

1 = Yj ^a^^O-^^fci). (23) 

ko,ki,...,kd,kd+i£% 

Now, for every (d + 2)- tuple (ko,k u ...,k d ,k d+x ) e X d+2 , one has either k > h, ...,k d ,k d+ \ or 
k\ > ko, k 2 , k d+ i ... or k d+ \ > k , k\, k d . By replacing some of these inequalities with strict 
inequalities, one can also make sure that the corresponding d + 2 regions in Z d+2 axe all disjoint. 

Fixing always the biggest parameter and summing over the rest of them, one can rewrite the 
constant 1 in (23) as 3 

J] ^)^i)...^(6)5I(£m)+ 

k 



I 




1r + (£ + <*i£i + - + a d ^ d )da 1 ...da d 



J] O yt (^)0,(^ 1 )...O yt (^m(^ + i). (24) 

k 

There are d + 2 inner terms in the decomposition (24) each containing a single ¥ type of a 
function. 

To be totally rigorous, some of these functions should be but this is a minor issue. 
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For some technical reasons that will be clearer later on, we assume that for the £ and 
variables we use compact Littlewood-Paley decompositions, while for the rest the non-compact 
one in (18). 

Let us assume now that in addition, one has £ + £1 + ... + g d+ i = 0. Look at the second (for 
instance) sum in (24) and consider the k = inner term which we write for simplicity as 



0(OT&)...0(&)0(& +1 ). (25) 
Since from (19) we know that ¥(£1) = %\<p(%{), one can rewrite this as 



TO) = - & - - - & + i) = -f - f - - - f i)- 

(26) 

Using this in (25), one can write it as another sum of 0(d) terms, containing this time two 
functions of *P type, since besides £x0(£i), one obtains in addition either expressions of type 
&0(6)for; = 2,...,d+l,or£0(£). 

If one does this for every scale k e Z and every inner term in (24), one obtains a decom- 
position of l{f+f 1 +...+& +1 =o} as a sum of 0(d 2 ) expressions whose generic inner product terms all 
contain two functions of *F type, which are more specifically of the form y0(y) 4 . If one inserts 
this into the formula for the (d + 2)- linear form (22), one obtains 0(d 2 ) (d + 2)- linear forms 
which will be carefully analized next. This is our generic decomposition. To be able to go 
further, one needs to understand how to unfold the symbol of C d . 

As before in the case of paraproducts, the positions of the functions (we denote them by 
ji, j 2 for < ji, j 2 < d + 1) will play an important role. There are in fact three distinct cases, 
depending on where these *F functions lie. 

Case 1: ji = and j 2 = 1. For symmetry, we change notations and rewrite the (d + 2)- linear 
form as 



X [( l R +(£ + aift + ... + a d g d )da 1 ...da d ] • (27) 

^ Jf+fi+...+& + i=0 J[0,l] d 

oo^)oi(^i)...of(^)of x (^ + o • mmi)..^^ d+ i)d^...d^ d+l 

with both families (Oj!)* and ($>{)k being of ¥ type. Without the C d symbol above, the expression 
(27) would be the (d+2)- linear form of a paraproduct, which could be analyzed as we described 
earlier. The first impulse to deal with it, is to try to decompose it into multiple Fourier series, 
on the support of the corresponding Whitney frequency boxes. However, it is clear that this will 
produce in the end an upper bound of the type of a product of 0(d) power series, which will 
grow exponentially in d even in the case of classical symbols, so one has to be very careful at 
this stage The situation is in fact even worse as we pointed out a bit earlier, since it seems that 
the Fourier coefficients of the symbol of C d for d > 2, do not decay quadratically as the ones of 
the symbol of C\ . 



And as a consequence, the Schwartz functions whose Fourier transforms are given by such expression, have 
integral zero. 
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The idea now is to realize that the variable £\ (in this case) is in some sense special and 
the right thing to do is to look at the C d symbol as being a multiple average of C\ symbols 
(depending on £1 and on a new variable £), which can be analysed as in [18]. 

To be able to execute this plan, since the functions ^> ] k (^j) for 1 < j < d do not have compact 
support, one has to insert yet two other compact Littlewood-Paley decompositions of the unity 
in (27). More precisely, denote by £ = £ + a 2 ^2 + ... + ad^d and write 

i = ^^im^o = J] ••• + Z ••• + Z - (28) 

ko,ki ko«k\ ko~Ki k(i»ki 

which can be rewritten as before as 



y o r (^^ r (^) + y y^y^) + y ^o^o. 



(29) 



To be totally rigorous, one should have had O r _ 10 o instead of <J> r in (29) and also finitely many 
middle *P - ¥ terms, instead of only one. We should keep that in mind but leave it as that, for 
the simplicity of our notation. 

As a consequence, if we insert (29) into (27), it splits as a sum of three distinct expressions 
which will be analized separately. We will denote this cases by l a , l h and l c respectively. 

Case l a . To see the effect of the new splitting (over r) in (27), let's analize for simplicity the 
particular term corresponding to k = 0. If we ignore the symbol J [Ql]d lp+(£ + + ■■■ + 
ad£,d)da\...da,d for a while, the rest of the expression becomes 



I 

r 



[o r (£)¥ r (£o] o^o^o.-.o^o^+i) 



(30) 



r<0 



r>0 



Case Y a . Using the fact that *P r (^i) is compactly supported and taking also into account the fact 
that Oq(^) is also of *P type (more precisely, as we have seen, it is of the form %i<p{i;\)), one can 



rewrite \' a as 



I 2 ' 

r<0 



r<0 



(d;j(^K^i)...^(6)^ +1 (&i)^) 



for certain compactly supported well chosen functions O r (^), *Fj(£i) and ^(£0 (naturally, the 
first is of <D type, while the other two are of type). 
In particular, we can split the symbol 



i 

Jo 



lR + (£ + ai£i¥<*i 



as a double Fourier series of the form 
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n,n\ 



where 



which is an expression independent of r. 
Recall now from [18] the crucial fact that 



\C r n , ni \(= \C n , ni \)< 



1 1 



< n > 2 < n x > # 

for any large number # > 0. 

These calculations show that the corresponding contribution of \' a in (27) is 



r<0 n,n\ 



f 



<^ +1 (& + i)fcr(ft/Wi(?i)»^ 

If one fixes now a 2 , a d e [0, 1], r and n, n\, the inner expression becomes 



f+fi+...+&+& + i=0 



O r (£ + a 2 & + - + a4 d )d^...d^ d+ i. 

We need now the following 

Lemma 4.1. If F, F\, F d+l and O are Schwartz functions, then one has 

I F(^)Fi(^ 1 )...F^" 1 (^ +1 )0(o^ + oi^i + ... + a d+l % d+l )d%d% l ...d% d+x 

J^!+...+fo+&l=0 



- at)Fi(x - a\t)...F d+ i{x - a d+ it)<b(t)dtdx, 



for every real numbers a, a\, a d +\. 
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Proof The formula is based on the following fact. If F is a vector subspace of R d+2 and 5 r 
represents the Dirac distribution associated to it and defined by 

6r(<f>) = £(f>(y)dy, 

for every Schwartz function <p then, 6 r = 8 r ^. In our case 

T = e R rf+2 : £ + + ... + & + & +1 = 0}, (36) 

and as a consequence T 1 - is the one dimensional subspace along the vector (1, 1, 1). Using 
this and Plancherel, the left hand side of (35) can be written as 

f F(g)F\ (£i ) . . .FrfTi (%d+ 1 )5>(a£ + + ... + a d+ ^ d+ {)^^ + "^d^^..^ d+l dx. 
JR 

If one adds to it 

0(a£ + aifi + ... + flrf+ifo+i) = f <D(0e- 2OTVW+aift+ - +ad+lfd+l) ^, 

JR 

one immediately obtains (35), by using Fourier's inversion formula several times. ■ 
We record also the following generalization of (35), which will be used later on as well 

I F{g)F\ (£i ) . . .F^Ti {%d+ i )5>i (a£ + oi^i + ... + fld+i^+O^W + fcifi + - + b d+l ^ d+l )d^ l ...d^ d+l = 
Jr (37) 



F(x - at - bs)Fi(x - ait - b\s)...F d+ i{x - a d +\t - a d+ is)Q>i{t)^ 2 {s)dtdsdx. 



Now, if G is an arbitrary Schwartz function and a a real number, we denote by G a the function 
defined by 

= G{Oe 2 ^. (38) 

Equivalently, one also has G a {x) = G(x + a). 

Using this notation and applying (35), our previous (34) becomes 

f* n 1 ^ na j 

2 (f * ®° h(x ~ t)(fi * Vl'^ix) Y\(fj * - ajt)(f d+1 * ®i+ l ){x)<b r {t)dtdx = 

JR j=2 

f* n n \ ^ na j 

2 (f * O ^Xx ~ t/ZXfi * Vr'*)(.x) Y\(fj * - ajt/2 r )(f d+l * ® d +l )(x)<I>o(t)dtdx = 

JR ;-o 



j=2 
d 



f 2 (f * * ^Xx) f\(fj * K 2,> 'X^Cfc+i * < l )(x)<S> {t)dtdx 

JR ;-o 



7=2 (39) 
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If one performs a similar decomposition for an arbitrary scale k ± now, the analogous formula 
of (39) becomes 



f 2 (f * <J>°'^)«C/i * y r T)(x) U(fj * ®t^)(x)(f d+ i * ® d k +1 )(x)a>o(t)dtdx. 

JR 7=2 (40) 

Summarizing everything, if one denotes by a = (a 2 , ar d ), one sees that the piece of C d that 
corresponds to Case Y a , can be written as 



. (n-t)aj 



J[0,i]^JR^ J 



<S>o(t)dtd3, 



(41) 



where naturally c r f' nua '' is the operator whose (J + 2)-linear form is given by the sum over k of 
the corresponding inner expressions in (40). 

Clearly, in order to prove (9) for (41), one would need to prove it for c r j n ' n, ' a ' t with upper 
bounds that are summable over r,n,n x and integrable over t and a. These operators c r : n ' nua '' 
are essentially paraproducts and for them one can apply the argument described in the previous 
section. However, the presence of all of these parameters, have the role to shift its ingredient 
functions a little bit, so this time one has to be very precise when evaluates the size of the 
boundedness constants. As before, the idea is to apply the perfect corresponding (12) to all the 
indices 2 < j < d for which pj = oo. This is possible because of the non-compact Littlewood- 
Paley L 1 normalized decompositions that have been used. Denote by S the set of indices 2 < 
j < d for which pj ^ oo. Now, if / = |5 1 + 2 and one freezes as before the L°° normalized 
Schwartz functions corresponding to the indices in {2, d} \ S , one obtains a minimal /-linear 
operator denoted by C l f'"'" lA 

Banach estimates for cJ r '"'" lA '. Fix indices 1 < si,...,si+i < oo so that l/si + ... + 1/si = l/s/+i. 
As in the previous section, the boundedness constants for 



depend on the boundedness constants of the following two square functions 



(42) 



z 

V k 



2\ 



1/2 



and 



V k 



.1/2 



and of several maximal functions of type 



sup 

k 



. (n-t)aj 



for j e S . 

It is not difficult to see that the square functions are the continuous analogue of the shifted 
discrete square functions S [S ^ ] and S Hl of [18] and as a consequence, they are bounded on every 
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L q space for 1 < q < oo, with upper bounds of type 0(log < [^4j >) and 0(log < n\ >) 
respectively, see [18] 5 . 

Likewise, the maximal functions are bounded by the shifted maximal functions M [ ^~ ] and 
therefore bounded on every U space for 1 < q < oo, with bounds of type 0(log < [ - 2 f - ] >), 
see again [18]. 

Using all these facts, one sees that the boundedness constants of (42) are no greater than 

C <r> 1 (log < n >/(log < m >/(log < [t] >)'. (43) 

Quasi-Banach estimates for c l f'"' nua '' '. Fix indices 1 < r 1; r t < oo and < r M < oo so that 
1/ri + ...1/ri = l/r/+i. We would like to estimate this time the boundedness constants of 

C lm4,t . L n x x L n ^ L r i+1 (44) 

and its adjoint operators. To achieve this, we will have to discretize the corresponding (40) even 
further (with respect to the x variable), to be able to rewrite the operator c l f' n ' n[,a,t in a form 
similar to (15), for which one can apply (16). One has first to observe that the bump functions 
corresponding to the index 1 in (40) are adapted to scales which are 2~ r times greater than the 
scales of the bump functions corresponding to the other indices. This fact suggests that the 
natural thing to do is to discretize using the bigger scale. On the other hand, one also observes 
that if a generic function O is a bump adapted to the dyadic interval J, and if J Q J is another 
dyadic interval 2~ r times greater that J, then 2 5r O is a bump adapted to J as well (5 corresponds 
to the number of derivatives in the definition of adaptedness). 

These facts, together with standard averaging and approximation arguments of [18] (includ- 
ing Fatou's lemma, etc.) show that our problem can be reduced to estimating expressions of 
type 

z i7p^ K/ ' °u |k/i ' < » n ° w iK/rf+i ' (45) 

where the functions /, (fj)j are as in (16) and (pj)j there are the same as our (rj)j here 6 . Using 
(16) and interpolation, we deduce that the boundedness constants of (44) are no greater than 

2- 6 ""(log < n >/(log < ni >)'(log < [t] >)', (46) 
and the same is true for all the adjoints of the operator. 

The final interpolation. Fix now indices p, (pj)j as in (9). Given that the desired estimates 
are on the edge of the Banach region, one can first use convexity arguments and linear interpo- 
lation only, to obtain many quasi-Banach estimates whose bounds do not grow too much with 
respect to r (at a rate of at most 2 ~ er say, for some small e). Then, one can use the multilinear 
interpolation theory from [22] and interpolate between these better quasi-Banach estimates and 

5 For every real number y, we denote by [y] its integer part. 

6 The power 6 above should be read as 5 + 1, where 5 comes from the adaptedness argument and 1 is a conse- 
quence of scaling, in particular all the bump functions are L 2 normalized now. 
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the previous Banach ones in (43), to realize that (9) for c r f' ni ' a ' t comes with a bound which is 
acceptable by (41). 

This completes the discussion of Case \' a . 

The rest of the cases follow a similar strategy. As one could observe, besides the qua- 
dratic/logarithmic argument, the presence of the decaying factor 2 r in (41) was also crucial. 
In the remaining of the paper we shall describe the adjustments that one sometimes needs to 
make in the other cases, in order for the above argument to work. 

Case 1". The 1" part, corresponding to r > is actually simpler, since this time the interaction 
between ^(^0 and ®q(£i) gives 

for some large constant M > 0, where ^(^i) is another *F function adapted to the same scale as 
*F r (£i). This huge decaying factor together with a similar argument as before, solve this case as 
well. 

Case 1/,. This is very similar to \ a . In fact, the only difference is that this time the corresponding 
Fourier coefficients can be estimated by 

1 1 1 1 

< n > 2 < n - n x >* < n > # < n x > # 
as shown in [18] and this still gives a contribution summable over n,n\. 

Case l c . Here, one has first to realize that on the support of W r {^)Q> r {^{) the symbol lp+(£ + 
a x t;i)dai behaves like a classical Marcinkiewicz-Hormander-Mihlin symbol and as a conse- 
quence, one has perfect decay for its Fourier coefficients, of type ^# ^pr- 

There are two subcases Y c and 1", which correspond as before to r < and r > respectively. 

Case l' c . In this situation, one just has to observe that 

O r (ft)oJ(fr) = Ortftfrtfai) = = 2 r 0^i)| = 2^(6) 

where ^(^0 is also of ¥ type. The presence of the factor 2 r above, shows that this case can be 
treated exactly as the previous \' a . 

Case 1". This time one observes that when the two functions <EV(£i) and O^i) interact, there 
is no decaying factor coming out of this and all one can say is that 

O r (fr)oJ(ft) = Oj(ft) 

where OqC^i) is another *F function, since ( ,(^i) is adapted on an interval which lies inside the 

one corresponding to O r (£i) (recall that r > now). To produce a decaying factor, one would 
have to argue somewhat differently. 

We will explain the changes that one has to make again in the k = case for simplicity, since 
as usual the argument is scale invariant. Consider the term 
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0g(#Oj(?i)...<ltf +1 (& + i). (47) 

Recall that 0°(£) is of *F type, but it has also compact support, since the Littlewood-Paley 

decompositions have been chosen to be compact for the and d + 1 positions. Pick Oq(£) 
another O function, supported on a slightly larger interval and which is equal to 1 on the support 

ofO°(£). Then, split (47) as 



oj;^)oi(^)...^ +l (^ + i)o;j(^) + 



(48) 



a>°(£)oi(£ 1 )...0^H& + i)[i - (49) 

The (d + 2)- linear form determined by (48) (and its family of analogs for each scale) can be 

treated as in the case and in fact it is even simpler, since this time one has that r (€)® XO = 
unless r = 1, 2, 3 (say). 

To understand (49), we rewrite it as 



0»(^)(D ( 1 ) (^ 1 )...0^ 1 (^ +1 )[Oj|(^) - 0°(#] = 



(50) 



-O{j(0Oj(fl)...Oj{ +1 (& + l) 



C< (d- 

Jo 



s)£ + sg) ds 



f 1 < 

Jo 



(£ + s(a 2 ^ 2 + ... + a d % d ))ds 



(Q-2^2 + ... + aidtjd). 



At this point we have to realize that we will loose another factor of d, because of the paranthesis 
(a 2 ^2 + — + a d t;d)- Each of these 0(d) expressions is of the form 



0g(£)Oj(fi)...Og +1 (& + i) 



f 

Jo 



O!! (£ + s(a 2 & + - + a d % d ))ds 



(51) 



where, for some 2 < j < d one has an extra factor of type a^j in addition to the previous 

Q> J Q (tfjj). Clearly, this just adds another harmless *F function to that j term, so it is enough to 
analize (51). The crucial observation here is to realize that when one hits (51) with a factor 
of type *P r (^)O r (^i), one has to have < s < C/2 r in order for the corresponding term to be 
non-zero. This means that one can simply replace the integral 



f 0° (g + s(a 2 fr + ... + a d td))ds 
Jo 



in (51) by 



f 

Jo 



c/r -x^i 

0° (f + s(a 2 ^ 2 + - + a d £ d ))ds. 
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Now, for each fixed r > and each < s < C/2 r , the corresponding form can be estimated 
exactly as before uniformly in s and in the end, after integration, one obtains an upper bound 

summable over r > 0. The extra factor <D{j + s(a 2 ^2 + ... + a d ^ d )) is of course harmless, since 
it just adds another average to the generic formula, as can be seen from (37). 
This ends Case 1. 

Case 2: j\ = and j 2 = d+l. The goal here is to show that after some calculations, one can in 
fact reduce this case to the previous Case 1 . To understand this, consider again a generic k = 
term, as the one in (47). One can split it as 

where 0q(£i) is of O type, compactly supported at scale one, while <^,(£i) is of type also 
adapted at scale one. Clearly, the B terms generate (d + 2)- linear forms similar to the ones in 
Case 1, and so it is enough to discuss the A terms only. Here it should not be difficult to realize 

that by construction, at least one of the two *P functions <D{j(£) and <Dq +1 (£/ + i), has its support 
away from zero. And moreover, we claim that without loss of generality, one can assume that 

Oo +1 (£/ + i) is that function. To see this, one just has to observe that the roles of the variables £ 
and % d+ i are totally symmetric. Indeed, since £ + £i + ... + = a simple change of variables 
shows that 



[ l R+ (£ + afo + ... + a d ^ d )da x ...da d = I 1 R + B& + ... + B d ^ d )dB x ...dB d 

J[0,l] d J[0,i] d 

which is obviously a similar symbol. 

In particular, one can clearly rewrite A as 

^(^(^uK'i^K 1 ^ + & + - + &) (52) 

for another well chosen compactly supported function V F^ +1 . Then, one rewrites (52) further 
as 

%)0l(^ 1 )...6p 1 (^ + l) X I? ri (^ + «2^2 + - + 

^(^(? 1 )...^ ri (& + i)[^ ri (f + ft + - + &) - + <*2& + - + £*,/&)]. (53) 

Clearly, the first term above can be written as 

Then, one can remark that the symbol 



I l R +(? + ari£i>/a?i 
Jo 
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is a classical symbol on the support of this term and its analysis becomes simpler. In particular, 
one no longer needs to insert the extra decomposition over r to study it. 

We are thus left with the second term of (53). But as before, this term can be written as 

O»(0^(fr)...6jp ^ 4«p ((1 - s%£ + & + ... + &) + s(£ + a 2 ^ 2 + ... + a£ d ))ds ■ 

■(^+(l-a 2 )^ + ... + (l-a d )^ d ). 
Then, as before, this expression splits into a sum of 0(d) terms, each of which containing an 
extra factor of type £j for some 1 < j < d, which transforms the corresponding j'th function into 
a *F one. As a consequence, our generic term will contain another T 1 in an intermediate position, 
exactly as in Case 1 . And again, as before, the presence of the extra factor 

i ^= ' 

V d +1 ((1 - + & + ... + &) + s(f + a 2 £ 2 + ... + a d { d ))ds 
is harmless because of the formula (37). 

Case 3: ji = 2 and 72 = 3. Finally, it is not difficult to see that the Case 3 can be analyzed as 
Case 1, since there are now two *F type functions in intermediate positions. 

This ends our proof since by symmetry, any other case can be reduced to one of these three. 

5. Generalizations 

To be able to describe and motivate the generalizations we mentioned at the beginning of the 
paper, we would first like to recall the classical calculations of Calderon, which gave rise to his 
commutators. 

Calculus with functions of linear growth. Let A(x) be a complex valued function of one real 
variable having linear growth, more precisely satisfying A' e L°°. We denote by Hf(x) the 
classical Hilbert transform given by 

Hf(x):=p.v. f f(x-y)^ (54) 

Jr y 

and by Af(x) the operator of multiplication with A(x). The question that started the whole 
theory, was wheather the commutator [H,A] was smoothing of order one, or equivalently if 
[H, A] o D maps U into U boundedly for 1 < p < 00, where Df(x) := f'(x) 1 . 
For / smooth and compactly supported, one can write 

(HA -AH) o D(f)(x) = (HA -AH)(f')(x) = 
p.v. [ A(x-y)f'(x-y)^-p.v. [ A(x)f'(x-y)^ = 

Jr y Jr y 

p.v. f A(y)f'(y)— dy-p.v. f A(x)f'(y)—dy = 
JR x-y Jr x-y 

7 As a consequence of this, HA could be written as HA = AH + [H, A] and therefore belonged to Calderon's 
algebra. 
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f A(x)-A(y) f ( A(x)-A(y) \ 

-p.v. / (y)dy = p.v. f(y)dy = 

JR x ~y JR\ x-y ) 

f A '<y) f( w M C Mx)-A(y) 

-p.v. f(y)dy + p.v. — —f(y)dy = 

JTLX-y JR (x - yr 

-H(A'f)(x) + CJ(x) 

where Ci is precisely the first Calderon commutator. Since both / — > H(A'f) and C\ are 
bounded on LP for 1 < p < oo, these show that indeed [H, A] is smoothing of order one. 
Besides the commutators, Calderon pointed out that more general operators such as 

f ^ p . v . { F imzm\ m j^ (55) 



x-y 



or even 



.V. f 

JR 



,^ p ,. , F (d2«) G (M^) /W ^ (56 ) 
x-y ) \ x-y ) x-y 

are worthwile to be studied as well, since they appear naturally in complex analysis or boundary 
value problems in PDE 8 . 

The work of Coifman, Mcintosh and Meyer [8] proved the desired estimates for all these 
operators, by reducing them to the previous (3). Finally, let us also remark that the above 
calculations show that if one denotes by a := A', then for g, Dg e LP with 1 < p < oo, one has 

H(ADg) = AH(Dg) - H(aDg) + C lg . (57) 

This is remarkable since apriorily, there is no direct and obvious way to even define H(ADg). 
Notice that on the right hand side, all the compositions make sense. 

Calculus with functions of polynomial growth, Part 1. Suppose now that A(x) is a function 
having quadratic growth, more precisely satisfying A" e L°°. Is it true that the commutator 
[H,A] is still a smoothing operator ? We will see that this time [H,A] is smoothing of order 
two. 

Indeed, for / a smooth and compactly supported function, given also the previous calcula- 
tions, one has 

(HA - AH) o D 2 (f)(x) = (HA - AH) o D(f')(x) = 



pv L 



, l A(x)-A(y) A'(y) 

p - v - LI {x . yf -—) ny)dy 

A(x)-A(y)-A'(y)(x-y) 



(x-y) 2 



f(y)dy = 



Here, the functions F, G are for instance analytic in a certain disc around the origin, while HA'Hoo, ||B'|U are 
supposed to be strictly smaller than the radii of convergence of F and G respectively. And of course, one can 
consider similar operators with more than two factors. 
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- pv L— 
2p - v -L 



-A'(y) -A"(y)(x-y)+A'(y) 
(x-y) 2 

A(x)-A(y)-A'(y)(x-y) 



f(y)dy+ 



H(A"f)(x) + 2p.v 



(x-y) 3 

A(x)-TlA(x)\ f(y) 



(x-yf 



f(y)dy = 

dy 



x-y 



where 



T l y A(x) : =A(y) + -^-(x-y) 

is the Taylor polynomial of order 1 of the function A, about the point y. Since / —* H{A" f) is 
clearly a bounded operator, the problem reduces to proving LP bounds for the linear operator 



/ ~> P-v 



'A(x)-T y l A(x)\ f(y) 



(x-y) 2 



x-y 



dy. 



(58) 



For functions of arbitrary polynomial growth satisfying A {d) e L°° for some d > 1, one can 
similarly show that [H, A] is smoothing of order d, if the operator 



-1( 



A(x)-T*- l A(x)\ 



fiy) 

x-y 



dy 



(59) 



(x-y) d 

satisfies the usual estimates, where Ty ~ l A(x) is the Taylor polynomial of order d - 1 of the 
function A, about the point y. 

More generally, as before, one can ask if the operators 



/ 



Jr 



( A{x)-T^- l A{x)\ 
y (x-y) d 



f(y) 

x-y 



dy 



(60) 



or even 



/ P- v 



B(x)-T^- 1 B(x)\ 



H 



( C(x)-T?- l C(x) 
(x - yY 2 



fiy) 



x-y 



dy 



(61) 



(x - y) d> 

are bounded on LP for 1 < p < oo, assuming that F,G,H are analytic as before, while 
A (d) , C (rf2) are all in L°° 9 . As a consequence of the method in this paper, one can an- 
swer all these questions affirmatively. Indeed, it is not difficult to see, thanks to the general 
averaging formula for the rest of a Taylor series, that as before, the problem reduces to proving 
LP x L°° x ... x L°° -> LP estimates for the (k + l)-linear operator with symbol 

— ^- f sgn(£ + + ... + a*&)(l - a x ) d ...{I - a k ) d da\...da k 



And in fact, even more generally, one can consider operators having an arbitrary number of similar factors. 
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which grow at most polynomially in k. But this symbol is very similar to the symbol of the 
Mi Calderon commutator and because of this, one can prove the desired estimates similarly. In 
fact, for k = 1 one can actually see that 

f sgn(£ + a?i£i)(l -a{fda x 
Jo 

is even better than the symbol of the first commutator, since at least along the line £ + f i = it 
becomes smoother, due to the presence of the extra factor (1 - a\) d . In particular, the quadratic 
estimates for its Fourier coefficients are still available. 

It is quite likely that the T\ theorem of David and Journe [9] can be used to handle the cases 
when F, G, H are of the form x n for some positive integer n. In fact, for F{x) = x this has been 
verified in [7] 10 . The more general case G(x) = H(x) = x has also been treated directly in [4]. 

Calculus with functions of polynomial growth, Part 2. There is an alternative calculation 
that one can perform to understand the previous question. Let us first observe that [H,A] is 
smoothing of order two if and only if [H, A] o H is smoothing of order two. Then, using the fact 
that H 2 = -I, one can write 

(AH - HA) o H( f) = AH 2 f - H(AHf) = 

-Af - H(AHf) = X - (H 2 (Af) + AH 2 f - 2H(AHfj) . 
If one ignores the \ factor, the above paranthesis (calculated at an arbitrary point x) becomes 

f .a, .dtds C dtds 

p.v. f(x + t + s)A(x + t+s) — —+p.v. f(x + t + s)A(x)— 

JR 2 t s Jr 2 t s 

J n dt ds C dtds 

f(x + t+s)A(x + t)— p.v. f(x + t + s)A(x + s)—— = 
R 2 t s Jr 2 t s 

J r dtds 
f(x + t + s) (A(x + t + s)- A(x + t)- A(x + s)+ A(x)) . 
R 2 t s 

To see if the last expression is smoothing of order two, one has (after integrating by parts with 

respect to the t variable) 



?V JR 2 



xl A(x + t + s)-A(x + t)- A(x + s)+ A(x) \ , 
f (x + t + s) ( I dtds = 



A' (x + t + s) — A' (x + t)\ 
(x + t + s) ( | dtds+ 



pv L f 



A(x + t+ s)-A(x + t)-A(x + s)+A(x)\ 
(x + t + s) ( — I dtds := 



A + B. 

Integrating by parts with respect to the s variable now, one can rewrite A as 



10 It is also interesting to see in [7] at page 94, another instance where these operators appear naturally. 
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,, dtds 

f(x + t + s)A (x + t + s) 

t s 



A (x + t + s) - A'(x + s) , dt 
+ t + s)— -ds— = 

s 2 t 

-A"(x)f(x)-H(C lA ,f)(x), 
where C\ A > is the Calderdn first commutator associated to the Lipschitz function A'. 
Similarly, one can rewrite B as 



A(x + t + s)- A(x + t) - A(x + s) + A(x) 
t 2 s 2 



-H(C lA ,f)(x) + p.v. J^ 2 f(x + t + s) " y f2 ' a " v " ' ' " x " 7 dtds = 

-H(C lA ,f)(x)+ 



J n (A, A, \dtds 

2 f(x + t + s) Mo-AOt) --, (62) 
R \t s ) t s 

where in general, A h g(x) denotes the usual finite difference at scale h given by 

&hg(x) = g(x + h)- g(x). 
The problem can be therefore reduced to the one of proving V estimates for the linear operator 
in (62). The reader may remember it from our previous paper [18]. If A has arbitrary polynomial 
growth (i.e. A {d) e L°° for some d > 1), then an analogous calculation shows that [H,A] is 
smoothing of order d, if the operator 

/ - P.v. f d f(x + t 1 + ... + t d ) o ... o ^A(x)) d X d J± (63) 

is LP bounded. 

And then more generally, one can ask the same question about the operator given by the 
expression 

p.v. f(x + h + ... + t d )F^ o ... o ^A(x)) d X.^± (64) 



td / h td 



or by (in the case of two factors) 



P- v - L d f( x + t i + - + td)G\—— o ... o ——B(x) \H\—- o ... o -—C(x) — 

h t d ] \ t\ t d / t l t d 



(65) 



where (aj)j, (bj)j are all different from zero real numbers and F, G, H analytic functions as 
before. It is also interesting to compare these formulae with the classical expression (55), which 
can be rewritten as 



p.v. I 
JR 



,A. \ dt 
f(x + t)F\^A(x)\-. 
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We now claim that essentially without any extra effort, one can prove the desired estimates for 
all these operators. Indeed, using that 

A A r 1 r 1 

— o ... o — A(x) = I ... I A (d \x + a x ti + ... + a d t d )da x ...da d , 
h td Jo Jo 

it is not difficult to see that (64) can be reduced to the study of the (k + l)-linear operator with 

symbol 11 



J[0,1]* 



d 

(£ + aigi + ... + a k ^ k )dai...da k \ (66) 



while the more general (65) to (k + l)-linear operators whose symbols are products of type 

sgn(£ + c\ol£x + ... + c l k a k ^ k )da l ...da k \ (67) 



[J (I 



[0,1]* / 

for various non-zero real numbers (c' ; ), ,-. The only fact that needs to be realized at this point is 
that the method extends naturally to cover product symbols of type (66) and (67) as well, since 
each individual factor can be decomposed as before, as a Fouries series with Fourier coefficients 
that decay at least quadratically. More specifically, the only difference in the argument is that 
instead of the Littlewood Paley decomposition in (28), which works very well in the d = 1 case, 
one has to consider a product of d such similar decompositions, each naturally corresponding 
to the factors of (67) 12 . 

Extended Calderon algebra. Given the previous discussions, it is also natural to consider 
operators of type 



f^p.v. I f( x + t)F\^o^A(x))^ (68) 



or even 



/ -> P-v. JT f(x + t)G (jB(x)j h[^o ^C(x)j j (69) 

and so on, and ask if they are LP bounded if F, G, H axe analytic and A", B' , C" e L°°. They can 
also be treated by the method of this paper. For instance, the study of (68) can be reduced to the 
study of multilinear operators with symbols of type 

I sgn(£ + (ori +>8i)£i + - + (<*k + B k )% k )dadB 

J[0,lp 

1 Notice that this is precisely the symbol of the kth Calderon commutator raised to the power d ! 
For instance, for d — 2, one takes a product of two 2-dimensional Littlewood Paley decompositions of type 
(28), one for the pair of variables (£i,£) corresponding to the first factor and another one for the pair of variables 

(£i , £) corresponding to the second factor. 



26 



CAMIL MUSCALU 



which clearly can be analysed in a similar manner. Are they related to Calderon 's algebra in 
any way ? one might ask 13 . To be able to answer this, we need to recall the bilinear Hilbert 
transform [13], [14]. 

If a is any real number with a ± 0, 1 , the bilinear Hilbert transform with parameter a denoted 
by BHT a , is the bilinear operator defined by 

BHT a (f,g)(x) := p.v. f f(x + t)g{x + at)%. (70) 
JR t 

It is known that these operators satisfy many LP estimates of Holder type [13], [14]. As a 
consequence of the results in [13], [14] and of the U theorem for (68) in the particular case 
F(x) = x, one obtains by a straightforward calculation that 

BHT 2 (f,A)(x) - 2HAf(x) + AHf(x) 
is a smoothing of order two expression, if A" e L°°. In particular, given also the earlier calcula- 
tions with functions of quadratic growth, one has that 

BHT 2 (f,A) = AHf + smoothing of order 2 term, 

which shows that the operator / — > BHT 2 (f,A) belongs to Calderon's algebra 14 . Also, the 
parameter 2 above can be replaced by any other a, as a consequence of the fact that the theorem 
for (68) holds true if one replaces y o ^ by —f- o ^ for some appropriate constants a, b. 

Finally, let us also remark that one can add derivatives freely to the general operators in (64) 
and (65) as well and still obtain bounded on LP operators. A typical example would be 

f -> p.v. f(x + t + s)F\—o — o —A{x) G — o — o — o —B(x) — — 

JR 2 \t t s I \t s s s j t s ^ 71 ^ 

for A'", B"" e L°°. The straightforward (by now) details are left to the reader 15 . 

Circular commutators. In [18] we noticed a certain symmetric bilinear operator which we 
named circular commutator. 

We would like to describe its natural trilinear generalization here 16 . Thus, we record the 
following theorem. 

Theorem 5.1. Let a ± 0, b ± and c ± be three fixed real numbers. Consider also three 
Lip schitz functions A, B and C. Then, the following expression 

JR 3 \ h )\ h )\ h J h h h 

1 -J 

Generally speaking, we say that an operator belongs to Calderon's algebra, if it can be written as a sum 
between a classical Fourier integral operator with a well defined symbol a(x, £) (such as AH for example) and a 
smoothing of a certain order term. 

14 An even simpler calculation shows that if A' e L°°, then BHT2(f,A) = AHf plus a smoothing of order one 
term. 

15 And the same is true if one considers the natural generalizations of them, when one may face more than two 
kernels and more than two factors in the corresponding expressions. 

16 There are of course many other multilinear generalizations of this, as the reader can imagine. 
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viewed as a trilinear operator in A', B' and C maps L Pl x LP 1 x LP 3 into LP boundedly, for every 
1 < pi,p2,P3 < 00 with 1/ Pi + l//?2 + 1/^3 = l//> 1/2 < p < 00. 

The proof of this theorem uses again the same method. If we assume for simplicity that 
a = b = c = 1 , then the symbol of the corresponding trilinear operator is given by following 
circular product 

ra 2 (£i, £>,£s) • ra 2 (£>,£s,£i) 1 W2(6,fi,&) 
and such symbols can be clearly treated in the same manner. 

T 1 calculations. It is very well known that the T 1 theorem [9] can handle Calderon commuta- 
tors quite successfully. More precisely, it allows one to reduce the study of C* to the study of 
Ck-i ■ Let us briefly recall the details, in the particular case of the first two commutators. 
Assume that A is smooth, compactly supported and A' e L°°. Rewrite C\a as 



A(.v + /)-A(.v) 



Then, integrating by parts, one gets 

Ci, A l(x) = p.v. A(X + ^ 2 ~ A(X) J? = -p.v. + -A(*))|jj dt = 

p.v. f A'(x + t)— =H(A')(x) 
JR t 

which is a BMO function. 

Suppose now that A, B are smooth, compactly supported and A', B' e L°°. As before, rewrite 
the second commutator associated to A, 5 as 

^ , f „ JA(x + t)-A(x))(B(x + t)-B(x)) 
C 2 ,a,bJ(x) = P-v. f{x + t) -3 dt. 

Then, one has 

r u , f (A(x + t)-A(x))(B(x + t)-B(x)) 

C 2 ,a,bKx) = P-v. = dt = 

JR t 3 



(A(x + t) - A(x))(B(x + t) - B(x)) I i I dt = 



1 r Al/ B(x + t)-B(x) J 
-p.v. A'(x + t)— — dt+ 

2 Jr t 2 

1 r A(x + t) - A(x) 
2 P ' V ' Jr 7 2 

X -C hB (A')(x) + X -CUB')(x) 

which are both BMO functions. Similar calculations can be performed to more general commu- 
tators of type 
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f " Jr f{x + (ir Mx) ) - (ir Mx) ) i (72) 

for every sequence (aj)j of nonzero real numbers. 

Even though the operators of the previous sections do not come in a standard form (in partic- 
ular, their kernels have a product structure) it is still tempting to check if there is an analogous 
Tl type reduction of complexity, available for them. 

Consider this time A, B smooth and compactly supported functions, satisfying A", B" e L°°. 
The simplest analogue of the first commutator, is the operator 7\ A given by 

Ti, A f(x) = p.v. f(x + t+ s) — o — A(x) — — 
JR \ t t ) t s 

for a, b real and nonzero. One can write 

„ . , . C A(x + at + bs) - A(x + at) - A(x + bs) + A(x) , , 

Ti,aKx) = p.v. — dtds = 

JR t 2 s 2 

p.v. J 2 (A(x + at + bs) - A(x + at) - A(x + bs) + A(x)) | y j |- j <i?J5 = 

p.v. f A"(x + at + bs)--dtds = Ho H(A")(x) = -A"(x) 
JR 2 t s 

which is in L°° and therefore still in BMO. 

However, as we will see, the problem becomes more complicated and the symmetry gets 

broken at the next step, when one considers the analogue of the second commutator T 2 a b given 

by 

WW = P.v. /(* + *+ s) o ^A(x)) o ^B(x)) f ^ (73) 
for a, b, c, d real and nonzero numbers. This time, one has 

T 2 ,a,bKx) = ^p.v. J^ 2 ( Aat ° &b s A(x))(A ct o A ds B(x)) |-j-J dtds. 

One particular term that appears after integrating by parts, is the one which corresponds to the 
situation when a pair of s and t derivatives hits the first term and another similar pair hits the 
second term. The operator obtained in this way is 

, , dtds 

A (x + at + bs)B (x + ct + ds) 

R 2 t s 

which is clearly of a BHT a j3 type and it is unlikely that such operators map L°° x L°° into BMO. 

Even worse, if instead of (73) one considers its natural generalization with 4 factors T^ AuAlA3A4 , 

a similar calculation generates the expression 

p.v. f A"(x + a\t + bis)A f 2(x + a 2 t + b 2 s)A"(x + a 3 t + b 3 s)A'^(x + a 4 t + b 4 s) — — 
JR 2 t s 
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and it is known that for generic choices of (aj)j and (bj)j this 4-linear operator does not satisfy 
any U estimates of Holder type [19]. 
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